Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-identification

نویسندگان

چکیده

RGB-infrared person re-identification is an emerging cross-modality task, which very challenging due to significant modality discrepancy between RGB and infrared images. In this work, we propose a novel modality-adaptive mixup invariant decomposition (MID) approach for towards learning modality-invariant discriminative representations. MID designs scheme generate suitable mixed images mitigating the inherent at pixel-level. It formulates procedure as Markov decision process, where actor-critic agent learns dynamical local linear interpolation policy different regions of under deep reinforcement framework. Such guarantees modality-invariance in more continuous latent space avoids manifold intrusion by corrupted samples. Moreover, further counter enforce visual semantics feature-level, employs convolution disassemble regular layer into modality-specific basis layers modality-shared coefficient layer. Extensive experimental results on two benchmarks demonstrate superior performance over state-of-the-art methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supplementary Material for “RGB-Infrared Cross-Modality Person Re-Identification”

This supplementary material accompanies the paper “RGB-Infrared Cross-Modality Person Re-Identification”. It includes more details of Section 4, as well as extra evaluations of our proposed deep zero-padding method. 1. Details of Counting Domain-Specific Nodes In the third paragraph of Section 4.2 in the main manuscript, we quantify the number of domain-specific nodes in the trained network in ...

متن کامل

Query Based Adaptive Re-ranking for Person Re-identification

Existing algorithms for person re-identification hardly model query variations across non-overlapping cameras. In this paper, we propose a query based adaptive re-ranking method to address this important issue. In our work, negative image pairs can be easily generated for each query under non-overlapping cameras. To infer query variations across cameras, nearest neighbors of the query positive ...

متن کامل

Pose Invariant Embedding for Deep Person Re-identification

Pedestrian misalignment, which mainly arises from detector errors and pose variations, is a critical problem for a robust person re-identification (re-ID) system. With bad alignment, the background noise will significantly compromise the feature learning and and matching process. To address this problem, this paper introduces the pose invariant embedding (PIE) as a pedestrian descriptor. First,...

متن کامل

Hierarchical Invariant Feature Learning with Marginalization for Person Re-Identification

This paper addresses the problem of matching pedestrians across multiple camera views, known as person re-identification. Variations in lighting conditions, environment and pose changes across camera views make re-identification a challenging problem. Previous methods address these challenges by designing specific features or by learning a distance function. We propose a hierarchical feature le...

متن کامل

View-Adaptive Metric Learning for Multi-view Person Re-identification

Person re-identification is a challenging problem due to drastic variations in viewpoint, illumination and pose. Most previous works on metric learning learn a global distance metric to handle those variations. Different from them, we propose a view-adaptive metric learning (VAML) method, which adopts different metrics adaptively for different image pairs under varying views. Specifically, give...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i1.19987